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Abstract — We determine the optimal achievable rate at which 
entanglement can be reliably transmitted when the memoryless 
channel used during transmission is unknown both to sender and 
receiver. To be more precise, we assume that both of them only 
know that the channel belongs to a given set of channels. Thus, 
they have to use encoding and decoding schemes that work well 
for the whole set. 

I. Introduction 

One of the main goals of quantum Shannon theory is the 
determination of optimal transmission rates for various quan- 
tum communication tasks. In contrast to classical information 
theory to every quantum channel we can associate various 
capacities each of which characterizes the optimal rates in a 
specific communication scenario. In this paper we focus on 
the determination of the entanglement transmission capacity 
of quantum compound channels. 

The correct formula describing this capacity for a single chan- 
nel has been identified in [1], [5], [9]. Of particular interest 
for our work are the later on developments by Klesse [7] and 
Hayden, Horodecki, Winter and Yard [6] which are based on 
a decoupling idea that can be traced back to Schumacher and 
Westmoreland [8]. 

We use their approach to determine the optimal achievable en- 
tanglement transmission rate under channel uncertainty: while 
sustaining the assumption of memoryless communication, we 
assume that sender as well as receiver only know that the 
channel they use belongs to some given set of channels. 
This describes a somewhat more realistic situation since exact 
channel knowledge will hardly ever be given in applications. 
Due to space limitation we will only give the proof of the 
direct part of the coding theorem for finite compound channels. 
The extension to the general case, the proof of the converse 
part and the relation to the entanglement-generating capacity 
of compound channels can be picked up in the accompanying 
paper [4]. 

The paper is organized as follows: We first fix the notation 
in section [TT] In section [III] we introduce our model and state 
the main theorem. Section ITVl contains two results concerning 
existence of recovery operations of a certain performance 
and behavior of entanglement fidelity under disturbance of a 
channel through a projection. The proof of our main theorem 



further uses some basic properties of typical projections and 
operations, which are stated in section [V] From there we pass 
on to the proof of our main theorem in Section [Vl] 

II. Notation and Conventions 

All Hilbert spaces are assumed to have finite dimension 
and are over the field C. S(H) is the set of states, i.e. positive 
semi-definite operators with trace 1 acting on the Hilbert space 
H. Pure states are given by projections onto one-dimensional 
subspaces. A vector of unit length spanning such a subspace 
will therefore be referred to as a state vector. 
The set of completely positive trace preserving (CPTP) maps 
between the operator spaces B(H) and B(JC) is denoted by 
C(H,!C). C^(Tt,]C) stands for the set of completely positive 
trace decreasing maps between B(H) and B(JC). il(7Y) will 
denote in what follows the group of unitary operators acting 
on Tt. For a Hilbert space Q C Ti we will always identify 
ii(G) with a subgroup of it(W) in the canonical way. For any 
projection q £ B(H) we set q 1 - := 1« — q. Each projection 
q G B(H) defines a completely positive trace decreasing map 
Q given by Q(a) := qaq for all a G B(TC). In a similar fashion 
any u G il(7i) defines a U G C(7i,7i) by U(a) := uau* for 
aeB{H). 

We use the base two logarithm which is denoted by log. 
The von Neumann entropy of a state p G S(H) is given 
by S(p) :— — tr(plogp). The coherent information for TV G 
C(H,K.) and p G S(H) is defined by I c (p,N) := S(Af(p)) - 
S{{id n ® N){\^){i)\)), where ijj G U <g> H is an arbitrary 
purification of the state p. Following the usual conventions 
we let S e (p,Af) := S({id H (g>Af)(\ip}(i>\)) denote the entropy 
exchange. 

For p G S(Ti) and Af G C^(H, JC) the entanglement Fidelity 
is given by F e (p,J\f) := (ip,(id n <g) J\f)(\tl))(ip\)ip), with 
ip G H <£) H being an arbitrary purification of the state p. 
In the following, a compound channel is identified with the set 
3 C C{7i,JC) of its constituents. It is called finite if 3 consists 
of finitely many elements. 

III. Codes, Capacity and Main Result 

An (I, ki)— entanglement transmission code for the com- 
pound channel 3 is a pair (V 1 of CPTP maps V 1 G 



C(J-uH® 1 ) where Ti is a Hilbert space with ki = dimJ 7 ; 
and TL l G C(/C®', T[) with ^ c .Fjf. 

A nonnegative number R is called an achievable rate for 
(entanglement transmission through) 3 if there is a sequence 
of (I, ki) -entanglement transmission codes such that 

1) liminfj^oo j logfc; > R, and 

2) limi^inf^ F^n^ll 1 ojV®* op 1 ) = 1. 

The entanglement transmission capacity Q{3) of the com- 
pound channel 3 is given by 

Q(3) := sup{7? G R + : R is achievable for 3}. 

Our main result can now be formulated as follows: 

Theorem 3.1: Let 3 C C(7i,/C) be a compound channel. 
The entanglement transmission capacity of 3 is given by 

1 



Q(3) = lim — max inf I c (p. 

Remark . Corresponding results for the entanglement transmis- 
sion capacity of a compound channel with informed encoder or 
informed decoder can be found in [4]. It is a remarkable fact 
that the proof of the coding theorem for an informed encoder 
is not, as in the classical case, just a trivial modification of 
the one for Theorem 15.71 

IV. One-Shot Results 

This section contains essentially two statements. The first 
gives an estimate on the performance of universal recovery 
operations for a given finite set of channels. The second relates 
the entanglement fidelity of a coding-decoding procedure to 
that of a disturbed version of the procedure, where disturbance 
means application of a projection after using the channel. 
Both results give rather loose bounds that become sharp 
enough only in the asymptotic limit. 

A. Performance of Recovery Operations 

Before we turn our attention to quantum compound channels 
we will shortly describe a part of recent developments in cod- 
ing theory for single (i.e. perfectly known) channels as given in 
[7] and [6]. Both approaches are based on a decoupling idea 
which is closely related to approximate error correction. In 
order to state this decoupling lemma we need some notational 
preparation. 

Let p E S(fi) be given and consider any purification ip E 
TC a <8 Ti-a = T~C, of p. According to Stinespring's represen- 
tation theorem any Af E C^(7i,/C) is given by 



■A/"( ' ) =tr Wa ((l«®p e M ■ )«*)> 



(1) 



where H e is a suitable finite-dimensional Hilbert space, p e is 
a projection onto a subspace of H e , and v : H — > JC <g> Tt e is 
an isometry. 

Let us define a pure state on H. a <8> K <£> Tie by the formula 
y/ := ; \ ri == (l Ha ® K <8> p e )(lw„ ® v)ip. 



VMMp)) 



We set 



Pa :=K K bh.W)W\), Pe :=*h.%k{WW\). 

The announced decoupling lemma can now be stated as 
follows. 

Lemma 4.1 (Cf. [7], [6]): For any TV G C l {H,IC) there 
exists a recovery operation 1Z G C(K,,H) with 

F e (p,KoAf) >W - \\wp' ae -Wp a ® p'eWx, 

where w = tr(Af(p)). 

We will make use of this lemma in the proof of the following 
theorem, which is the heart of the proof of Theorem 13.11 In 
order to state the theorem, we need to introduce the code 
entanglement fidelity which is, for p G S(H),Af G C(H,IC) 
(referring to p as the code) given by 

F c , e (p,AT) := max F e (p,KoN). 

Theorem 4.1 (One-Shot Result for Averaged Channel): 
Let the Hilbert space H be given and consider subspaces 
£ C Q C 7i with dim£ = k. For any choice of 
A/i, . . . ,7Vjv G C^(H,IC) each allowing a representation 
with rij Kraus operators, j — 1, ...,7V, and and for any 
u E il(G) we set 



1 N 



1 



N 



— S^MjoU. Then 
N ^ J 



F c , e {-K£,Nu)du > tr(AA(7r g )) - 2 




I PVj frills 



where the integration is with respect to the normalized Haar 
measure on U(G) and TT£,irg are the maximally mixed states 
on £ and Q. 

Remark . The above Theorem gives a lower bound on the code 
entanglement fidelity of an averaged channel. Since entangle- 
ment fidelity is affine in the operation, F e (TT^ n jj X^ili-^?' 



V ) > 1 — e; implies F e (jrFn 



> V 1 ) > I- Net for every 



i E {1, . . . , N}. If 3 is finite and e; becomes arbitrarily small 
for good codes, this Theorem gives a sufficient estimate. The 
case of general 3 exploits the difference in polynomial growth 
of the number TV; of approximating channels for 3 versus 
exponential decay of ej. 

For the proof of this Theorem, we shall need the following 
two lemmata: 

Lemma 4.2 (Cf. [3 ]): Let L and D be N x N matrices with 
non-negative entries which satisfy 



jl ^ L,jj, L,ji ^ ijiu and Dji < max{Djj,D u } (2) 



Lji < Ljj, Lji < L 
for all j,l E {1,.. ., N}. Then 

N 



v 1 

N 

j,/=i 



iV 



3 = 1 



p' :=tr Wa ®w.(|^><^|), 



= frx:(|V')<^l), 



Lemma 4.3 (Cf. [4]): Let £ and £7 be subspaces of 7i with 
£ E\ Q E\ ri where k := dim£, dg := dimQ. p and 
will denote the orthogonal projections onto £ and Q. For a 



Haar distributed random variable U with values in U(G) and 
x, y G B(H) we define a random sesquilinear form 

b UpU ,(x,y) := \x{UpU*x*UpU*y)~tx{UpU*x*)tx{UpU*y). 



where 



D 3l {u) 



Then 



T -{b UpU *(x,y)} 



1 



d? - 1 
l-k 2 



tx(pgx*p g y) 

tx(pgx*)tx(p g y). 



d(d 2 1) 

Proof of Theorem 14.71 We can assume without loss of gen- 
erality that the numbering of the channels is chosen in such 
a way that n\ < ri2 < . . . < holds for the numbers of 
Kraus operators of the maps A/i, . . . ,AFn. From Lemma |4~T1 
we know that for every u G U(G) there is a recovery operation 
1Z such that 



F e (-K£,1loJ\f u ) >w- \\wp' ae - wp a 



(3) 



where we have used the notation introduced in the paragraph 
preceding Lemma 14.11 and the states on the RHS of equation 
(O now depend on u. 

For each j G {1, . . . , N} let {bj be the set of Kraus 
operators of J\fj. Then J\fj oU has Kraus operators {OjVi}i=i 



given by cij 



bj.iu. Let ...,/iv} and {e x ,. 



} 



be arbitrary orthonormal bases of C N and C" N with only 
imposed restriction that e% ® /i = Let the projection p e 
and unitary u in ([TJ be chosen in such a way that for each 
d> G H the relation 



7 — 1 2 — 1 V 



(b j , i (t>)®ei®f j , (4) 



holds. For a purification ^ G 7i a ® W of the state 7Tg we 
consider a Schmidt representation 



; fin 



with suitable orthonormal systems {hi, . . . , hk} and 
■ ■ -,9k}- 

We use this representation to derive explicit representations 
of the states p' ae , p a , p' e in terms of the Kraus operators 
of the operations Mi and insert them into (0. If we 
perform the unitary conjugation induced by the unitary map 



h, 



s,h3 



fj followed by 



the complex conjugation of the matrix elements with respect 
to the matrix units {\x' s ij)(x' t k i\} s ,i,j,t,k,l we obtain an 
anti-linear isometry I with respect to the metrics induced by 
the trace distances on the operator spaces under consideration. 
A calculation identical to that performed by Klesse [7] and 
additionally using the triangle inequality for 1 1 • 1 1 1 as well as 
the relation ||a||i < vd||a||2, d being the number of non-zero 
singular values of the operator a shows that 



F c , e (ir £ ,j\fu) > tY(M u {n £ )) 



N 



3,1=1 



E 

i—l.r—l 



{\i{p(a* A ai t r)*pa* ,air) - 7 |tr(pa* ,a;, r )| 2 ) 



(dependence on u is through atj = biju) and Lji := 
min{rij, n{\. 

Let U be a random variable taking values in il(G) according 
to the Haar measure of 01(G)- Then we can infer from that 



EF c . e (TT £ ,AfoU) 



> 



Etr(/V oU(n £ )) 

N 



(6) 



-E^Vi^ iE,Z) " ,r)) - 



3,1=1 

where we have used concavity of the function 
and Jensen's inequality. Now, setting Dji := 
(N 3 (-Kg)iNi(-Kg)) H s, where ( • , • ) HS denotes the 
Hilbert-Schmidt inner product, and using Lemma 14.31 we 
obtain 



ED jt (U) < tr(A^(7r e )M(^)) = D jt . 



(7) 



It is obvious that Lji < Ljj and Lji < La hold. Moreover, 
the Cauchy-Schwarz inequality for the Hilbert-Schmidt inner 
product justifies the inequality Dji < max{Djj, Du}. 
Therefore, an application of Lemma 14/21 allows us to conclude 
from © that 

N 

E(F c>e (7r s ,MoU))>tr(M(7rg))-2j2Vkn]\W3^Q)\\2, 

3=1 

which is what we aimed to prove. □ 

B. Projections and Entanglement Fidelity 

Lemma 4.4: Let p G S(TL) for some Hilbert space H. Let, 
for some other Hilbert space K, A € C(H, K.), T> G C(/C, H), 
q G B(K) be an orthogonal projection. If for some e > the 
relation F e (p, T> o Q o A) > 1 — e holds, then 

F e (p,VoA) > l-3e. (8) 
The following Lemma 14.51 contains an inequality which will 
be needed in the proof of Lemma 14.41 

Lemma 4.5 (Cf. [4]): Let V G C(K,,H) and x x _L x 2 , z be 
state vectors, xi, X2 G /C, z G H. Then 

\(z,V(\x 1 )(x 2 \)z)\ < V\(zMV Xl )z)\-\(z,V(V X2 )z)l 

where V y := \y)(y\ for arbitrary state vectors y G H,JC. 

Proof of Lemma \4.4\ Let dim7Y = h, dim/C = k, \ip)(ij}\ € 
H a <£> H be a purification of p (w.l.o.g. H a — 'H). Set T> := 
idn a ® D, A := irf-« a <g> *4, g := ® g and, as usual, q 1 - 
the orthocomplement of q within Ha <8> /C. Obviously, 

= (xb,v(qA(W(mm + (</>,% ± i(|V>M)g x )V'> 
+(^,P(gi(|V)(^l)g ± )V'> + (V, ^(3 X -A(I^) (WW) 

> (i>,T>(qA(\i>)&\)q)i)) - 2\{^,V(qAm{^\)q x )^}\ 
= F e (p,VoQoA)-2\(i;,T>(qA(\^(^\)q ± )^)\. (9) 



We establish a lower bound on the second term on the RHS 
of ®. Let 

K-h 

^(|V>^l) = X;A i |o i )<a i | ) 

i=l 

where {ai, . . . , a K .h} are assumed to form an orthonormal 
basis. Now every ai can be written as a» = aiXi + fayi where 
Xi £ supp(<7) and £ supp(g ± ), i £ {1, k ■ h}, are state 
vectors and S C. Define cr := then 

K-h 

<r = Yl X j(\ a i\ 2 \ x i) ( x 3 I + a j0j\ x j)(yj I 
i=i 

+P 3 a*\y 3 )(x 3 \ + \P 3 \ 2 \y 3 ){ y] \). (10) 

Set X := \ (ip,V(qA(\tp)('ip\)q- L )ip}\. Then, using the decom- 
position (fTOb and the abbreviation 7\, := |u>)(w| (for w £ K. 
being a state-vector) 

X = \{i,,V{qaq^)\ 

K-h 

K-h 

.K-h K-h 

i=i j=i 

= F e (p,VoQoA)-F e (p,VoQ ± oA) 

< e. (11) 

Here, a follows from utilizing Lemma |431 b is an application 
of the Cauchy-Schwarz inequality and c is true by assumption. 
The inequality (TTTT) establishes ((H). □ 

V. Typical Projections and Operations 

At this point, we introduce the minimal amount of state- 
ments about typical projections and operations that is needed 
for the proof of Theorem 13.11 The reader interested in more 
details is referred to [3], [4] and references therein. The basic 
idea is that we throw away some non-essential information 
about an object and get nice estimates in return. 

Lemma 5.1: There is a real number c > such that for 
every two Hilbert spaces H, K, the following hold: 
There are functions h : N -> R + and ip : (0, 1/2) ->■ R + with 
h(l) \ and tp(6) \ (Setting d := dimW, k := dim/C, 
h and ip are given by h(l) := -plog(7 + 1) Vi € N and 
tp{8) := -Slog ^ V6 £ (0, 1/2)) such that 
A) For any p e"s(ft)) 5 £ (0,1/2), I £ N there is an 
orthogonal projection qgj £ B(H) lg ' 1 called frequency-typical 
projection that satisfies 

1) tr(p»'ga,,) > 1 - 2^ c&2 ~ h ^\ 

2) qsjp® l q&,l < 2- l ^-^q SA . 
The inequality 2) implies 

\\mp & qs,l\\l<2- 1 ^-^. 



B) For each M £ C{H, JC), 6 £ (0, 1/2), I £ N and maximally 
mixed state ttq on some subspace Q C H there is an operation 
M$,i £ C^(H® l ,JC® 1 ) called reduced operation with respect to 
M and ttq that satisfies 

3) tr(AM?rf Z )) > l-2-^ cS2 - h ^\ 

4) N't, i has a Kraus representation with at most ng,i < 
2 i('s^ s M)+v{&)+h{i)) operators. 

5) For every state p £ S(H® 1 ) and every two channels T £ 
C l (H® l ,H® 1 ) and C £ C l (JC® l ,H® 1 ) the inequality 
F e (p, C o M$j oI)< F e (p, C o M® 1 o 1) is fulfilled. 

VI. Proof of Theorem I3.1I 

We will restrict our proof to the case that 3 consists 
of finitely many elements. Also, we only prove the direct 
part Q(3) > lim^oo j inf^ ea max pe<s(w ®i) I c (p,M® 1 ). The 
converse part for finite 3 follows from an application of 
Lemma 6 in [5], In order to pass on to the case of general 3 
one approximates J by a sequence (3/)zgN of finite compound 
channels. It has to be taken care that the numbers A) := \3i\ 
increase subexponentially fast in I. All calculations are carried 
out in our papers [4] and [3]. 

Let us consider a compound channel given by a finite set 
3 := {Mi, . . . ,M N } C C{H,K) and a subspace Q £\H. For 
every I £ N, we choose a subspace Ti C Q® 1 . As usual, njr l 
and ng denote the maximally mixed states on T\, respectively 
Q while ki := dimJF; gives the dimension of Ti. 
For j £ {1,...,N},5 £ (0, 1/2), I £ N and states Mj{itg) let 
qj,s,i € B(JC)® be the frequency-typical projection of Mj(irg) 
and M 3t s,i be the reduced operation associated with M 3 and 
Trg as given in Lemma I5T1 

For an arbitrary unitary operation u £ B(TL® 1 ) we set 

1 N 

ri 3 \ uKS ■■= Q jt S,t o M m o U\ M l u i S := j^Yi^i,^ 

i=i 

1 - 

3=1 

Let U l be a random variable taking values in il(G ) which 
is distributed according to the Haar measure. Application of 
Theorem 14.11 yields 

EF c , e (^,M^ s ) > (12) 

AT 

-2Ev^^ll^(^)ll2, 

3 = 1 

where rij sj is the number of Kraus operators of M 3l s,l- Notice 
that Qj,s,i°M'j } s,i has a Kraus representation containing exactly 
n 3t s,i elements. We will use inequality ( fT2l in the proof of the 
following Lemma. 

Lemma 6.1 (Direct Part for maximally mixed states): Let 
3 = {M\, ...,Mn} C C(H, JC) be a compound channel and -Kg 
the maximally mixed state associated to a subspace Q C H. 
Then 

Q(3) > mmI c {-KQ,Mi). 



Proof. We show that for every e > the number 
miriA/' i g3 I c (irg,Afi) — e is an achievable rate for 3. 

1) If min^ea I c (ivg,Ni) — e < 0, there is nothing to prove. 

2) Let min^ea I c {^g,J^i) — e > 0. 

Choose 5 G (0, 1/2) and l eN satisfying 2 • y>(<5) + h(l ) < 
e/2 with functions tp, h from Lemma IBTTl 
For every I G N let the dimension of the subspace Ti C Q® 1 
be given by 

kl = \ 2 l ^ minMie3 7c ( Tr 6,A^)-e)j 

By S{itg) > I c (irg,Afj) (see [1]), this is always possible. 
We will now give lower bounds on the terms in (I12t . thereby 
making use of Lemma 15.11 

tr(A/"J(7r|')) > 1 - 2- 2-'< c52 -' i «). (13) 

A more detailed calculation can be found in [3] or [7]. Further, 
using that \\A + > \\A\\l + ||£f||| holds for nonnegative 
operators A,Be B(1C® 1 ) (see [7]), we get the inequality 

\Wls^f)\\l < 2- l( - s( ^^-^l (14) 

From (1121 1. fll31 l, dT4l > and our specific choice of fc; we get for 
every I > l 

^F c ^{-K ri ,N l ul s ) > 1 - 2- 2-'< c,52 -' i W) - 2Ny/2~ u / 2 . 

This shows the existence of at least one sequence (W z , TZ l )i^ 
of (Z, fc;)— entanglement transmission codes for J and 

liminf — logfc; = min I c (irg,Afi) — e 

l^oo I A/iG3 

as well as (using that entanglement fidelity is affine in the 
channel), for every I G N with I > Iq 

min F e {-K ri ,n l oM s oW l )>l-N\ei (15) 

where W'(-) = w l (-)w 1 *, w l G lt(0®') V? G N, and 

ei = 3 • (2 • 2-'( CI?2 ~' i «) + 2W2- (£ / 2 ). (16) 

For every j G {1, . . . ,7V} and Z G N\{1, . . . , h - 1} we thus 
have, by property 5) of Lemma 15.11 construction of N\ wj s , 
and equation (I15t . 

7^(^,7^ o Q iM oMfo W z ) > 

> F e (7r^ , ft' o Q^, o M 3 ,sj o W l ) 
= F e {Kr n K l oN l jw]J> ) 

> 1-N-ei. 

3 

By Lemma [4~4l this immediately implies 

min F e {-Kr v 1l l oMf oW ! ) > 1-JVe, V? G N\{1, . . .,Z -1}- 

A/,- S3 J 

Since e > was arbitrary, we have shown that 
mm^/ i& ^ I c (Trg,Afi) is an achievable rate. □ 



Lemma 6.2 (Compound BSST Lemma, Cf. [3]): Let 3 C 
C{7i,JC) be an arbitrary set of channels. For any p G 5(7i) 
let G B(H® 1 ) be the frequency-typical projection of p and 
set 

tt,,! := G S(W®')- 

tr (<7<M) 

Then there is a positive sequence (<5;);gN satisfying 
limi—Kx, <5; = with 

lim j vc£ U-k Su1 ,N® 1 ) = w£ I D {p,M). 

From Lemma 16.11 and the fact that 

Q(J & ) = IQ(3) (17) 

holds for every I G N we get independent from the value of Z 
and for every maximally mixed state 7Pf ( G <S(7i®') supported 
on a subspace JF; C 7i®' the inequality 

Q(J)>jmin; c (^,^'). (18) 

Let p G 5(7i) be arbitrary and (Si)i^, (TTs,,i)ieN as i n Lemma 
16.21 Then by ( fT8l and Lemma 16.21 we have 

Q(3) > lim y min J^Trs,,,,^ 81 ) 

(— too t A/i£j 

= min/ c (p,M). (19) 

Thus, > maXp £ s(H) min^ea ^c(P) M) has to hold. A 

second application of equation (TTTb and taking the limit Z — ► oo 
yields the desired result. □ 
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For the proof of Theorem 13.11 we only need one more 
ingredient, which is a generalization of the well known BSST 
Lemma of [2]: 



